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Abstract — One of the main goals of extreme value analysis is to estimate the probability of rare 
events given a sample from an unknown distribution. The upper tail behavior of this distribution 
is described by the extreme value index. We present a new estimator of the extreme value index 
adapted to any domain of attraction. Its construction is similar to the one of Pickands' estimator. 
Its weak consistency and its asymptotic distribution are established and a bias reduction method 
is proposed. Our estimator is compared with classical extreme value index estimators through a 
simulation study. 



1 Introduction 

Suppose one is given a sequence Xi,...,Xn of independent and identically distributed (i.i.d.) 
observations from some distribution function F. Suppose there exist sequences a„ > and b„ and 
some ^ G R such that: 

max(Xi, . . . ,Xn) - b„ 



lim P 

n — *oo 



< X 



(1) 



with 6*^(2;) = exp[— (l+^x)^^^'"] if^ 7^ OandGo(a;) = exp[— e~^], where ?/+ — max(0, y). Necessary 
and sufficient conditions on F for the convergence Q to the extreme value distribution can 
be found in 15 . The aim of this paper is the definition of a new estimator of the extreme value 
index ^ G R. This parameter drives the decay of the tail distribution: as a power function if ^ > 
(Pareto, Burr, Student's, Log-gamma distributions, etc . . . ), exponentially if ^ = (Exponential, 
Normal, Log-normal, Gamma distributions, etc . . . ) and with finite right endpoint if ^ < 
(Uniform, Beta, Reversed Pareto, Reversed Burr distributions, etc ...). The knowledge of ^ is 
for example of high interest for extreme quantile estimation which arises in a lot of applications 
|12j such as finance, insurance, hydrology, etc . . . There is a substantial number of publications 
dedicated to the estimation of this extreme value index, especially on the heavy tailed distribution 
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context > 0) (see Beirlant et al. Feueverger and Hall jEl and, for a recent overview of 
this literature, see Csorgo and Viharos [H])- The most popular estimator in this case is the Hill 
estimator defined by: 

1 

ik,n = j:^^T^{Xn_i+i^n) " ln(X„_fc_„), for /c = 1 , . . . , n - 1 , 

where Xi^n < • • ■ < ^n,™ correspond to the random variables Xi, . . . , X„ rearranged in ascending 
order. The consistency and the asymptotic normality of this estimator are proved for example by 
Davis and Resnick Csorgo and Mason p], etc . . . 

The general case ^ € R has been less extensively studied. Dekkers, Einmahl and de Haan ^T] 
have adapted the estimator proposed by Hill to this situation. Another estimator was proposed 
by Pickands 

ik,n = T-py: In -T7 , for fc = 1, . . . , [n/4j , 

where \x\ denotes the integer part of x. Weak and strong consistency as well as asymptotic 
normality of ^ „ were established by Dekkers and de Haan 10 . Proofs are based on the following 
well known result: Let \J be the tail quantilc function of the distribution function F defined by 

U{x) = ^ ^ 

(the arrow means inverse function) and let 



1 - F(x) 



(pt ix) ^ J u* ^du, a; > 0, t G R. 

Relation Q holds if and only if (see de Haan there exist a positive measurable function a 
such that uniformly locally on x > 0, 

U(tx)-U(t) , , 
t^oo a\t) 

Clearly, relation |(2Jl implies that uniformly locally on x, ?; > 0, ?/ 7^ 1, 

U{tx)-U{t) ^ 

Thus, by substituting in ?7 by [/„ = [1/(1 — Fn)]^ {Fn denoting the empirical distribution 
function), t by n/{2k), x by 1/2 and y by 2, and remarking that C/„(n/fc) = Xn-k+i.m we have 
asymptotically 

n—^ Xn — k+l,n Xn — 2k+l,n -. 

^n-2fc+l,ra — ^ra-4fe+l,n 
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Pickands' estimator £^J^^ is the solution of the equation (@J. One can notice that this estimator 
does not take into account of the extreme observations Xn-k+2,71, • ■ • , Xn,n- 



In the next section, we define a new estimator of the extreme value index ^ when ^ G R. This 
estimator is similar to the one of Pickands but exploiting the information given by the spacing 
between Xn-k+i,n and X„^„. Weak consistency and asymptotic distribution are established in 
section 3 and a bias corrected estimator is introduced. Section 4 is devoted to the proofs of the 
main results and a simulation study is presented in section 5. 



2 Estimation of the extreme value index 

We propose to estimate the extreme value index ^ G R by ^k,n defined as the root of the equation 
in 6: 

(p0{l/k') \ Xn-k+l,n — Xn,n 



, - 1, for 1< fc' < /c < n. (5) 

(pe[i-/K) ) An-k' + l,n — ^n,n 

We can show (see Gardes [31 j Appendix B) that © admits an unique solution. This estimator 
applies to all real ^ and, as Pickands' estimator, remains unaffected when the scale or location of 
the data are changed. Furthermore, as we will see on a simulation study, the behavior of S^k,n is 
less influenced by the parameter k than Pickands' estimator. One can justify the definition of S^k,n 
by the two following lemmas: 

Lemma 1 Suppose that relation holds. Then, if ^ < 0, 

U{tx) - U(t) 1 
iim = — - 

t— ►00, 2;— >oo ^(t) S 

and if C > 0, 

U(tx) - U(t) 
hm ^ — — — = +00. 

t — >oo, X — >oo 



As a consequence of Lemma ^ we have: 

Lemma 2 Suppose that relation holds. Then, 



,ip^{x)j U{ty)-U{t) 
as i — > 00, a; — > with — > 00 and x/y — > d > 0. 



(6) 
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Lemma n and Lemma |2| can be seen as an extension of respectively (|2Jl and Q when x and y are 
going to zero or infinity. The proofs of Lemma^and Lemma|2]are postponed to the Appendix. By 
substituting in © J7 by [/„, thy x by l/k'{n) — 1/k' and y by l/k{n) = 1/k with fc/fc' c > 1, 
k oo and fc/n — > as n — > oo, we have asymptotically: 

which is an intuitive justification for the definition of i,k.n- The next section is dedicated to the 
study of (^k,n asymptotical properties. 

3 Main results 

3.1 Asymptotic properties 

We first state the weak consistency of i^k.n under some conditions on k and k' . 

Theorem 1 Suppose that relation |^ holds. If k/k' c > 1, fc — > oo and k/n ^ as n ^ oo, 
p 

then ^fe,„ — > 

Remark 1 Similar conditions on k are used by Dekkers and de Haan JHI to prove that Pickands' 
estimator is weakly consistent. 

To establish the asymptotic distribution of the estimator S^k.rn additionnary conditions are intro- 
duced. The first of them is a cornerstone in all proofs of asymptotic normality for extreme value 
estimators. 



{HI) — U has a positive derivative and there exist a slowly varying function I such that U'{x) — 
x^-H{x). 

We refer to ^ for more details on slow variation theory. The next condition controls the uniform 
rate of convergence of £{tx)/£{x) to 1 as x — > oo. Let S — min(— ^, 1/2) and introduce the random 



variables Kk 



(F=l-F). 



F{Xn-k+i.n)/ F{Xn.n) and Nn = l/F{Xn.n) where F is the survival function 



iH2) 



ipsik') sup 



£{tNn/Kk'. 



£{N„/Kk',n) 
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Our second main result is the following: 



Theorem 2 Let VkiO = ipsik)[{ln{k) - 1)I{^ > 0} + 1]. Under the conditions of Theorem[Tl(with 
k — ck') and if {HI) and {H2) are satisfied, we have for all i G R: 



lim V[Vk{mk,n-i)<t] = < 



exp(— e *) 
exp(— e^*/^) 



exp 



-[l + <ln(c)M(l/c)] 



$ ln(c)/(2^(7)] 



if < ^, 

if e = 0, 

if -1/2<C<0, 
if e<-l/2, 



(7) 



where a — c ^(c — 1)^^^ and <i> is the cumulative distribution function of the standard normal 
distribution. 

Remark 2 

i) Theorem 121 states that the asymptotic distribution of is Gaussian if ^ < —1/2 and an 
extreme value distribution if ^ > —1/2. If C = ^1/2, we prove that Vk{C}{^k.n — converges 
to a non-degenerate distribution with non explicit cumulative distribution function. In fact, 
as it will appear in the next section, the limit distribution of ^fc^„ is driven by k' /Kk'.n if 
C < -1/2, by r„ if ^ > -1/2 with 

Yn = Vk/a[(p!:{Kk',n/Kk,n) - fci'^/c)] 

and by both of them if ^ = —1/2. 

ii) (HI) and {H2) are second order conditions on the tail quantile function U. Similar conditions 
are used by Dekkers and de Haan JO] to establish the asymptotic distribution of Pickands' 
estimator. 

iii) Let ^(^) = 7l{^ > 0} - [1 - r(l - £_)]ip^{l/c)/\n{c)l{-l/2 < < 0}, where 7 is the Euler 
constant and F is the gamma function. Thcorcm[51cntails that Vk{£,){(,k,n — £,) converges to a 
distribution of mean /z(^) if ^ 7^ and ^ 7^ ^1/2. This suggests to define the bias corrected 
estimator: 



As we will see on a simulation study (see section 15.111 , this bias correction improves the 
behavior of our estimator in most finite sample situations. 
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3.2 Examples 

Let a, (3 > 0, 9 ^ IC\{0} and define ln2(x) = ln(ln(x)), a; > 1. The two following models of slowly 
varying functions £ are considered: 

£{x) = a + ex-f^ +o{x-^), (Model A), 

jdel B). 



eix)=9[Hx)]-^^^l + o(^-^'^Y (Mac 



Model A has been first introduced by Hall 18 . In both models, the parameter f3 tunes the decay 
of the slowly varying function £. The conditions that should be satisfied by models A and B to 
insure convergence (0 are given in Corollary^ In both cases, the best rate of convergence of ^k,n 
is also established. Some examples of distributions satisfying the assumptions of Corollary ^ are 
presented in Tabled In the sequel, the following notation is adopted. Let (u„) and (w„) be two 
non negative deterministic sequences. The notation u„ x means that 

< lim inf — < lim sup — < oo . 

Corollary 1 Suppose that k — ck', k ^ oo and that F satisfies assumption (HI) with a slowly 
varying function asymptotically monotone. 

i) If £ belongs to Model A and if ips{k') {n/k') ^ ^ Q then convergence (0 holds. In this case, 
the best rate of convergence of S,k,n is given by: 

ln(n) if < 

Vk{0- { W{n) if e = 0, 

^^5/[l+6/p\^e if^<o, 

where e g]0, 5/ (1 + 5/ P)[ is arbitrarly small. 

ii) If £ belongs to Model B and if (^^(fc') In(fc')/ ln(n) and (^^(fc') ln2(n)/ ln(n) then, 
convergence (0 holds. Furthermore, the best rate of convergence of S^k,n is given by: 

ln2(n) ifO<^, 

Vk{S.)^ { \nl{n) if e = 0, 

ln^""(n) if^<0, 
where e g]0, 1[ is arbitrarly small. 

Remark 3 This corollary points out the fact that the case ^ < is more favorable to our estima- 
tor, i.e. its convergence is faster than in the case > 0. This is illustrated by the simulation study 
(see section |5j. 
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Distribution 


Cumulative distribution 
function 


Model 


fi 


Best rate of 
convergence 


WeibullM 

(CeR) 


exph(l+ea^)-i/€], 
for X such that 1 + > 0. 


A 


1 


ln(n) if < ^ 
In^(n) if C = 

^S/(l+S/P)-e if ^ < 


Burr 

(e>o) 


1 - [wl{w + a;^)]^, for a; > 0, 
with w, A, r > 0, C = l/l^'r), 


A 


1/A 


ln(n) 


Frechet 

(e>o) 


exp(— a;~^/^) for a; > 0, 


A 


1 


ln(n) 


Weibull 

(^ = 0) 


1 — cxp(— Acc'^), for a; > 0, 
with A, r > 0. 


B 


1 - 1/t 


ln2(n) 


Normal 
(^ = 0) 


l/V2^e-*'/2d< 


B 


1/2 


ln2(n). 


Reversed Burr 

(e<o) 


1- K(w + (a:F-a;)-^)]^, 
for a; < a;^, with ^ = 1/(At). 
w, A, r > 0, xp G R. 


A 


1/A 


^S/{l+5/l3)-e 



Table 1: Examples of distributions satisfying the assumptions of Corollary ^ 
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4 Proofs of the main results 

This section is devoted to the proof of TheoremHand TheoremEl Proofs of lemmas are postponed 
to the appendix. 

4.1 Preliminary results 

The following function will play an important role. Let 

"""^"^^ - \ ^,(l/fc) / X„_fe, + i,„ - X„,„ - 1 ^x{l/k) + 
with Zn = {Xn-k+l,n — Xn-k' + l,n) I {Xn-k' + l,n — Xn^n)- 

Lemma 3 Under the conditions of Theorem 

i) Z„ = [U{N,jKk,r.) - U{N^/Kk'^n)]l[U{N,jKk'^n) - U{N^)]. 

ii) Nn ^ +0O, Kk,n ^ +00, Kk^n/Kk';n ^ c, Kk,n/Nn ^ and Zn ^ max(0, c"^ - 1) 
as 71 — )■ oo. 

iii) k' /Kk'.n Exp(l). If moreover k — ck' , then y„ A/'(0, 1). 

4.2 Proof of Theorem [T] 

We shall need the following result: 

Lemma 4 Suppose that relation |^ holds. If ^ > 0, for all rji g]0,^[, 772 > 0, there exist to, 
Pi, 02, /?!, /32 > such that, for all t > to and a; > 0, 

/3r.«-- - < < ;32X«-... _ 

a{t) 

If ^ = 0, for all 77 > 0, there exist to, P > such that, for all t > to and a; > 0, 

a{t) 

Proof of Theorem^ — We have to show that, for all e > 0, 

lim F[\ik,n - el > e] = 0. (8) 

n — >C30 

Remark that if ^ ^ 0, proving ^ for all £ > reduces to demonstrate for all < e < |^|. Since 
Hn is a non-decreasing function (see |14j . Appendix B) and since Hn{S,k.n) — 1, we have: 

nik,n - ^1 > e] = nHnii + £)<!]+ P[i?„(^ -£)>!]. 



To prove Theorem[T]it is sufficient to establish that P[i?„(^ + e) > 1] ^ 1 and P[iJ„(^-£) < 1] ^ 1 
as rt — > oo. The two following expansions hold: 

= 1 + fc-* - fc'-* + o(fc-*) if t > and ^^4^ ^ c* as n oo if i < 0. (9) 

The two following cases are considered separately: 

If g > , © and Lemma El ii) imply that H^i^ + e) = 1 + Z„ + fc-«+^) - fc'-(«+^) + op[/c-(«+^)] 
since ^ + £ > 0. Furthermore, from Lcmma|21i), 

d UjNjKk^n) - £/(A^n/j^fc'.n) a{N„/Kk',n) , . 

a{N^/Kk',n) U{N,JKk'.n)-U{N^)' ^ ' 

" V ' " V ' 

with 

(11) 

from (O and Lemma 01 ii) and 

PiKi-.:' - 01 < < p^KitT - /?2, (12) 

from Lemma^l Since, from LemmaOiii), k' /Kk>^n converges to a standard exponential distribu- 
tion, we deduce from that k'-^i+^'> /Z„ which entails that Hn{^ + e) - 1 ^ Zn > 
i.e. that P[i?„(C + e) > 1] ^ 1- Similarly, we prove that P[i?„(C - e) < 1] ^ 1- 
If ^ < 0, expansions (O and Lemma Oii) imply that -ff„(^ + e) — > > 1 since ^ + e > 0. Thus, 
P[Hn{£, + e) > 1] 1- In the same way, we prove that P[i/„(^ — e) < 1] ^ 1- ♦ 



4.3 Proof of Theorem El 

Let us define the function: 



1 + tx if t 7^ 0, 
e^ if t = 0. 

To prove Theorem (21 two auxiliary results are necessary. Lemma is dedicated to the study of the 
function ip^ . 

Lemma 5 

I) For X e (0,oo), iftlMx)] =a:*+i^*=°l. 
Let (u„) and be two sequences such that u„ ~ Vn (i.e. u„/u„ — > 1). 
II) Let t = 0. If i<n — > (X) and m„ — w„ ^ a then ip^{vn) ^ ipl{un)e~°' ■ 
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III) Let i 7^ 0. If w„ ^ (X) then ip^(vn) ^ (pl{un)- 

IV) Let t 0. If M„ ^ ^1/^ with w„ — + £„), then: 

i) If moreover e„/(^( (u„) ~ a„ where a„ does not converge to oo or to 1, then 

ii) If moreover e„/iy9j (u„) —>■ oo then ip1(vn) ^ —Sn- 

The proof of this basic resuh is not detailed here. Clearly, the distribution of Hn{x) is determined 
by Zn- The following lemma provides the asymptotic distribution of Z„. 



Lemma 6 Under the conditions of Theorem |21 



lim P 



exp(-t-i/C) 
exp(-t~i) 



if < 

if e = 0, 



= < 1 -cxp(-i-i/«) if-l/2<e<0, 



P[T < 



if e - -1/2, 



^ <i)H^e(i/c)V^H if e<-i/2, 

where the random variable T is defined as the limit in distribution of 



Y 



which is non-degenerate from Lemma |31 hi) 



Proof of Theorem H - Let J^„(t) = P[Vfc(0(6,n - < We have 
F^t) = P \ik.n < e + t/Vkid = nHni^ + t/VkiO) > 1] = P 



(1 + Z„)^^i^^^^^^>l 



since iJ„ is a non-decreasing function and since i?n(Cfc.n) = 1- Routine calculations yield: 



< u. 



with 



Remarking that is an increasing function for ^ > and decreasing for ^ < 0, we have. 



(-^^) > (fc')V|(in) 



if e > 0, 

if C < 0. 



(13) 
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The asymptotic behavior of the left hand side random term 

is given by LemmaEl Let us now focus on the right hand side deterministic term {k')^^^^^^^^ ip'^{tn) . 
Different cases have to be considered: 

If ^ > 0, the following sequence of asymptotic equivalences holds 



Since Vk{C) ^ ln(fc)/C, we have that fc*/^''^?) ^ e*« as n ^ oo. Thus, t„ ~ (fc')^/^e*^ — > c» as 
n -> oo. LemmaEllII) implies that (p|(i„) 1 + (fc')^e*« - (fc')^e*«. Thus, 



lim (fc')'-'^«="V|(in) = e*«. 



(14) 



If ^ = 0, we have 



Using the expansions. 



tn = ln(c) 



„t/\n^(k) _ ^t/ln^(fe) 
1 _ ct/ln'(fe) ' 



we find that 



In^(fc) Vln'*(fc) 



t„ = In(fc) 



andfc*/i"'W = l+t-47T+i'^ 



In(fc) 21n2(fc) Vln^(fc)/' 



t/2 - ln(c) 



In(fc) 



Since tn ^ oo and In(fc) — t„ ^ — </2 + ln(c) as n ^ cxd, LemmaUllI) implies that 
V>l{tn) ¥'|(ln(fc)) exp[t/2 - ln(c)] = /c'exp(t/2). 

Thus, 



lim (fc')'-'^^="Vf(in) = e*/2. 



(15) 



When ^ < 0, we have: 



Remarking that 



1 c-« - 1 



C*/V"fc(«)[l _ (fc')«+*/'^''«(«)]. 



^V.(o+''U.(oJ' 

and that (k'yl^^^^^ = 1 + o(l) lead to the foUowing expansion: 



ln(c) 



l + o 



^(0 



1 + t 



ln(c) 



[I - {k'f + o{k^) 
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with Un = -l/C[l + tln(c)/Vfe(0] and 

c ^ — 1 

Two situations have to be considered: 
If -1/2 < C < 0, it fohows that 



hm ' 



and Lemma |S1 IV) i) imphes 



2-«-l 



c 



and thus 



hm = 1 + ti^. (16) 

If ^ < -1/2, £n/9^|(wn) (1 - c~'^)~^ as n ^ cxD. Thus, Lemma IV) i) yields 

Hm (fc')'-'^«=°V|(in) = -|ln(c)ft^. (17) 

Collecting p;-{|l - 117|l with Lemma El concludes the proof. ^ 



5 Simulation study 

In this section, the improvement brought by the bias correction is illustrated through a simulation 
study. Next, a comparison with classical extreme value estimators is proposed. For each of the 
distributions considered in this section, N = 100 random samples of size n = 500 are generated. 

5.1 Bias corrected estimator behavior 

We first study the behavior of the bias corrected estimator ^ versus the estimator ^k.7i- In this 
aim, the following distributions are considered: (see Tabled for their parameterizations) 

• Case ^ > 0, Frechet distribution with ^ = 3. 

• Case g = , WeibuU distribution with (A, r) = (1, 1/2), (A, r) = (1, 1) and (A, r) = (1, 3/2). 

• Case -1/2 < ^ < 0, WeibullM distribution with ^ -1/3. 

• Case C < -1/2, WeibullM with ^ = -1. 
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In Figure^ the empirical mean over the N samples of Q „ and ^k,n is represented as a function of 
the number k of upper order statistics ( "Hill plot" ) . The true value of ^ is represented by a straight 
line. To compute these estimators, we choose c — k / k' = 4. If ^ > (Frechet distribution, Figure^ 
(a)), the behavior of £^k,n is improved by the bias correction. If ^ = (WeibuU distribution), the 
estimation is highly influenced by the parameter t which controls the rate of convergence of the 
slowly varying function £ (see table^. If r < 1 (Figuren(b), (c)), „ is less biased than £^k.n and 
if r > 1 (Figure n (d)), the bias correction does not improve the behavior of ^k.n- If — 1/2 < ^ < 
(Figure^ (e)), Q „ is slightly more biased than the estimator ^k,n- Finally, if C < ^1/2 (WeibullM 
distribution, Figurcn(f)), there is no correction {^k.n = £,1 «)■ As a conclusion, it seems that the 
bias correction improves (or at least does not really degrade) the behavior of our estimator. Thus, 
in the sequel, we focus on the behavior of „. 

5.2 Comparison with other estimators 

The estimator ^ is now compared with the following well known estimators: Pickands' esti- 
mator ^|'„, the moment estimator proposed by Dekkers, Einmahl and de Haan 11 and defined 
by: 

1 

1 



PA/ ^ Pff J- 1 _ 1 



i^k,n)'^ 



with 



Sk,n 

where Sk.n — ^/k^'^^i[ln{Xn-i+i,n) — ln(X„_fe.„)]^ and the generalized Zipf estimator [2] 
defined by: 

^ Hik + In UHj^n - I EU ln[(fc + EU 

Sfc.n , \ 2 ' 

(i ' \ 

The following distributions are considered: (see tabled for their parameterization) 

• Case ^ > 0, Burr distribution for which ^ = 1/(tA) with (/3, t. A) = (1, 1, 1). 

• Case ^ = 0, standard normal distribution. 

• Case -1/2 < ^ < 0, WeibullM distribution with ^ = -1/4. 

• Case C < -1/2, WeibullM with ^ = -2. 
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In Figures l2l4l the empirical mean and the empirical Mean Squared Error (MSE) of each estimator 
are represented as functions of k and we also choose c = 4. If ^ > (Burr distribution), „ is 
less biased than the other estimators (Figure 121(a)) but it suffers from a high variance (Figure |21 
(b)). If ^ = (Gaussian distribution, Figure |21 (c) , (d)), all estimates yield very poor results. If 
— 1/2 < < (WeibullM distribution, Figure [3 (e) , (f)), ^j! „ provides the best estimation and if 
^ < —1/2 (Figure 131, generalized Zipf estimator and ^j! „ are equivalent from the MSE point of 
view. 

Finally, let us focus on the influence of the rate of convergence of the slowly varying function on 
the estimation of ^. In this aim, we consider the reversed Burr distribution for which ^ = — 1/(At) 
(see Tabled for its parameterization). Here, the parameter A controls the rate of convergence of 
the slowly varying function (see Section rO|l . The larger is A, the slower i converges to a constant. 
This is illustrated in Figure^lfor xp — 10, w — 1, t — 1/A with A = 1, A = 2 and A = 3. In 
all cases, „ performed better than Pickands' and moment estimators and the best estimation is 
provided by the generalized Zipf estimator. 

As a conclusion, „ performes well in the WeibuU domain of attraction (^ < 0) and it is compet- 
itive with Pickands' and moment estimator if ^ > 0. 

Remark 4 The high volatility of the previous Hill plots points out the importance of the choice 
of the number k of upper order statistics. A lot of methods to select this parameter have been 
proposed (see Danielsson et al. [Jj, Gomes and Oliveira 16 , Guillou and Hall ^^Ij • . • ). A part of 
our future work will consist in the adaptation of these methods to our estimator. However, note 
that in all cases, our estimator is less influenced by this choice than other estimators. 
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Figure 1: Comparison of the bias corrected estimator ^ (solid line) and £^k.n (dashed line) for (a) 
the Frechet distribution, (b) the WeibuU distribution with (A, r) = (1,1/2), (c) the Weibull dis- 
tribution with (A,r) = (1, 1), (d) the Weibull distribution with (A,r) = (1,3/2), (e) the WeibullM 
distribution with ^ = —1/3 and (f) the WeibullM distribution with ^ = — 1. 

15 
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(a) mean 



(b) MSE 




100 150 



(c) mean 



(d) MSE 




100 150 



(e) mean 



(f) MSE 



Figure 2: Comparison of the estimator ^ (solid line), moment estimator (dashed line), Pickands' 
estimator (dotted line) and the generalized Zipf estimator (dash-dot line) for (a), (b) the Burr 
distribution with (/3,t, A) = (1,1,1), (c), (d) the standard normal distribution and (e), (f) the 
WeibullM distribution with ^ = — 1/4. 



(a) Mean 



(b) MSE 



Figure 3: Comparison of the estimator ^ (solid line), moment estimator (dashed line), Pickands' 
estimator (dotted line) and the generalized Zipf estimator (dash-dot line) for the WeibullM distri- 
bution with ^ = — 2. 



17 




50 100 150 200 250 50 100 150 200 250 



(a) mean (b) MSE 




50 1 00 150 200 250 50 1 00 1 50 200 250 



(c) mean (d) MSE 




(e) mean (f) MSE 



Figure 4: Comparison of the estimator ^ (solid line), moment estimator (dashed line), Pickands' 
estimator (dotted line) and the generalized Zipf estimator (dash-dot line) for the reversed Burr 
distribution with (a), (b) A = 1, (c), (d) A = 2 and (e), (f) A = 3. 
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APPENDIX 



Proof of Lemma First, we focus on the case ^ < 0. Recall that one can take in relation 
a(t) = ^[U{t) — xf] where xp is the right endpoint of the distribution function F. Thus, 

Ujtx) ~ U{t) ^1 XF- U{tx) 1 

a{t) e e[c/w-.^F] c ^ ' 

since xp > U{t) for all t. Furthermore, since U \a & non-decreasing function, we have that for all 
^ > and x>A, 

Ujtx) - U{t) ^ UjtA) - U{t) 
Using we have that for all A,e>0 there exist T such that for alH > T and x > A, 

i2i«> -1(1 (19, 

ait) C 

Using (|18|l and (|19|l . we conclude the proof for ^ < 0. Second, suppose that ^ = (the proof for 
^ > is quite similar). Using relation ^ we have that for all A,e > there exist T such that for 
alH > T and x > A, 

U{tx) ~ Ult) 



a{t) 

which concludes the proof. 

Proof of Lemma [2| — Remark that 

U{tx) - U{t) 
U{ty) - U{t) 

with 



> (l + £)ln(A), 



1 + z{t,x,y), 



_ U{tx)-U{ty) _ U{tyx/y)-U{ty) a{ty) 
Z[l, X, y) — TTTTT ~ TT"; ^ 



U{ty)^U{t) a{ty) U ity) ~ U {ty / y)' 

" V ' " V ' 

zi(t,x,y) Z2(t,x,y) 

Since y — > oo and x/y — > d, relation ^ implies that zi{t, x, y) converges to ^^{d). Since ?; — > oo and 
ty —> oo, Lemma^implies that Z2(t,x,y) converges to min(0,^). Thus, 1 + z{t,x,y) — > 1 if ^ > 
and l + — > if ^ < 0. Remarking that ip^{y) / (f^{x) — )■ 1 if ^ > and (p^{y)/(p^{x) c^^ 
if ^ < concludes the proof. l|k 

Proof of Lemma |3] — Let Wi , ■ ■ ■ , Wn be independent standard uniform random variables and 
Wi,n < . . . < Wn^n the Corresponding order statistics. 
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i) Remarking that 



(1 — Ki^n/Nn)i<i<n = iF{Xn-i+l,n))l<i<n = (W^ri-i+l,n)l<i<n, 

it follows that 

— (^ri-i+l,n)l<i<n, 

which conclude the demonstration. 

ii) We have Kk,n = Wk,n/Wi^n- Using Renyi representation |T] p. 72, Kk,n = Tk/Ti where Tk 
is the sum of k independent standard exponential random variables. Another use of Renyi 
representation leads to K^^n = l/W^i,fc-i- Since Wi^k-i as n — > oo, we prove that 
Kk,n ——^ cx) as n — > oo. Similarly, Nn — — ^ oo as n — > oo. 

Now, remark that Kk^n/ K^' = Wk,n/Wk' .n- Renyi representation yields Kk^n/ K^' .n == 
Tk/Tk' = {k/k')[Tk/k]/[Tk'/k'] ^ c as n ^ oo. The proof of Kk.n/N,, ^ as n oo is 
similar. 

Finally, Lemma [21 and Lemma 13 i) ii) imply that: 

ip^{l/k) 

which concludes the proof using k/k' — > c. 

iii) Since 1/Kk^n == W^i,fc-i 1 — W/c-i,fc-i, and remarking that 

W^fe_i,fe_i) -i^Exp(l), 

it follows that k/Kk.n has asymptotically a standard exponential distribution. 

Now, let . . . , En be independent standard exponential random variables and < • . . < 

En^n the corresponding order statistics. Dekkers and de Haan (Q^, Lemma 2.1) shows that 



I k d 

y -^—j{En-k/c+l,n - En-k+l,n " ln(c)) > 7V(0, 1). 

Since Kk,n = exp(i;„^„ - En-k+i,n) and Kk'^„ = exp(i;„,„ - En-k/c+i,n), the ^-method 
applied to the function ip^{e^^) concludes the proof. 4 



Proof of Lemma 131 — This proof is inspired by the one of [2], Lemma 0.13. We only give the 
proof for ^ > (the case ^ = is quite similar). For j € N, let 



f3j 



exp 



J 
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Using relation (O, we have that for any £G]0,1 — (1 + ^) ^[ there exist to such that for all t > to 
and for aU j e N\{0}, 

l~s<^M-«^<l+e. (20) 

Furthermore, since a{.) is regularly varying at infinity with index ^ (see |21| . Proposition 0.8 v) 
and Proposition 0.12), we have that for any e e]0, 1 — (1 + £,)^^[ there exist to such that for all 
t > to and for all j e N\{0}, 

Piil - e) = (1 + 0(1 - e) < -^1^ < (1 + 0(1 + e). (21) 



Let iV e N. We have: 



U{tf3N) - U{t) _ j2 ^(^/^j) " UWj-i) ajtPi-i) 



ait) a{tp,.i) a{t) 

Using |(2nj), we find that for t >to: 

(l-e)V^%il<^^i^M^<(l + OE^. (22) 



Remarking that 



a{t(3j_i) _ -Q a{t(3i) 



a{t) j-^^ a{t(3i^i) ' 
(PT|l and 123) imply that there exist /3i, /32 > such that for t >to: 

/3i[(l + 0(1 - £)]^ - /3i < ^(^^^) ; < /3,[(i + 0(1 + £)]^ - P,. (23) 

Let iV:r = [C/ln(l + 0]ln(2^) (i.e. x = exp[iVa;/^ln(l + 0])- Remarking that [N^l < < [N^l +1 
implies that /3[jv^j < x < /3[Ar^j+i- Since [/ is a non-decreasing function, we find that 

U{tpiN^i)-U{t) ^ Ujtx) - U{t) ^ U{tf3^N^i+,)-Uit) 
a{t) ~ a{t) ~ a{t) 

and, using 

/3i[(l + 0(1 - e)] L^^J - /3r < ^^'"^"^^^'^ < /^^[(l + 0(1 + e)] - /J^- 

Thus, there exist Pi and /32 such that: 

/3i[(l + 0(1 - £)]^^ - /3i < ^^'''^"^^'^ < /32[(1 + 0(1 + - 02. (24) 
Remarking that 
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with 771 =-^ln(l-£)/ln(l + ^) e]0,C[and 

[{I + + e)f^ = x^+^^ 
with rj2 = ^ ln(l + e)/ ln(l + ^) > concludes the proof. 



Proof of Lemma IHl— The first step of the proof consists in estabhshing the foUowing expansion: 



1 - 



1 +0p 



Let = i7(e*). From Lemma Eli), we have: 



^ d y'[ln(iV„/i^fe-,„)] C/(7V„/i^fc^„) - t/(iV„/i^fc,,„) 



V 



l/'[ln(A^„/iffe,,„)] 



Clearly, 



1 



^l,n Jo 

and conditions {HI) and (-ff2) imply that uniformly on [0, ln(/'irfe/.„)] 



ds, 



V'MNn/Kk',n)+s] 
V'[HNn/Kk',n)] 



1+Op 



1 



Thus, 



Zi, 



1 + op 



/•ln(^fc',„) 

io 



1 + op 



The proof of 

Z2,n ^ fi{Kk',n/Kk,n) 1 + Op 

follows the same lines. Collecting (|2fi|l and (|27|l yields 



ips{k) 



which proves (|25() by remarking that 



1 +Op 



= 1 - 



-Yn. 



(25) 



(26) 



(27) 



Now, remark that H25|) can be rewritten u„ = u„(l + £«) with m„ = (p^(Kk' ^n), Vn — ~ip^{l/c)/Z„ 
and 



+ op ( ^1 = Op +op(^]JUo, 



from Lemma El iii) • The second step of the proof is dedicated to the study of (p'^ (— (y5j(l/c)/Z„). 
Five cases have to be considered: 
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If ^ > 0, since u„ — ^ +00 as n 00 (Lemma |21iii)), Lemma El HI) entails that (^^(u„) ~ (p^{vn), 
i.e., 

(-^^) - ^*MiK..n)] = {K,,,^r'i^-'\ (28) 

by Lemma 01). 

If ^ = 0, we have u„ — > +00 as ?i ^ 00 and u„ — = u„e„ = op[ln(_R'fe',„) In(fc)] = op(l). Thus, 
Lemma mil) imphes (I28II . 

If — 1/2 < ^ < 0, we have from Lemma 13 ii) that Un ^ 1/C as n 00. Remarking that 
£n — op[l/ips{k)] — op(fc^) implies 



Op 



[(fc/iffe',„)«] =Op(l), 



which entails H28() by Lemma |5l IV) i). 

If ^ = —1/2, we have u„ — — 1/C as n ^ cx3. Remarking that £„ ~ — CT/[%/A:cpj(l/c)]y„ yields; 



where a„ does not converges in probability to cx) or 1 as n — > 00 (see Lemma |21iii)). Thus, from 
LemmaEllV) i), we have (/p|(w„) ^ (p|(u„)(l — a„) i.e. 



(29) 



p p / — p 

If ^ < —1/2, we have w„ — > ^1/C as n — > 00, e„ ~ — (7/[vfc</'5(l/c)]^n and En/ 'f*f{un) — > 00 as 



p 



n — ^ 00. Thus, Lemma |5l IV) ii) implies that ip*^{vn) ^ — i-e. 
Lemma 01 iii) and (PH|l - (pn)l conclude the proof. <|b 
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